A 7.3 M Output Non-Zeros/J, 11.7 M Output Non-Zeros/GB Reconfigurable Sparse Matrix–Matrix Multiplication Accelerator

نویسندگان
چکیده

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Output Consensus Control of Nonlinear Non-minimum Phase Multi-agent Systems Using Output Redefinition Method

This paper concerns the problem of output consensus in nonlinear non-minimum phase systems. The main contribution of the paper is to guarantee achieving consensus in the presence of unstable zero dynamics. To achieve this goal, an output redefinition method is proposed. The new outputs of agents are functions of original outputs and internal states and defined such that the dynamics of agents a...

متن کامل

The Input/Output Complexity of Sparse Matrix Multiplication

We consider the problem of multiplying sparse matrices (over a semiring) where the number of non-zero entries is larger than main memory. In the classical paper of Hong and Kung (STOC ’81) it was shown that to compute a product of dense U×U matrices, Θ (

متن کامل

Evaluation of M-PIRO system text output

ii Declaration I hereby declare that this MSc dissertation is of my own composition and that it contains no material previous submitted for the award of any other degree. The word reported in this MSc Dissertation has been executed by myself, except where due acknowledgement is made in the text. iii Acknowledgements I desire to express my gratitude to all those people without whom this task wou...

متن کامل

Non - Lapped Blurred Blocks Restored Output Blocks ( a ) Non - lapped decoder Output

|This letter presents an improved version of an algorithm designed to perform image restoration via nonlinear interpolative vector quantization (NLIVQ). The improvement results from using lapped blocks during the decoding process. The algorithm is trained on original and di ractionlimited image pairs. The discrete cosine transform is again used in the codebook design process to control complexi...

متن کامل

designing a reconfigurable accelerator

many of the video processing algorithms cannot be implemented in real time on general computers, due to their computational complexity. for an efficient implementation, a custom hardware that can be reconfigured for the algorithm, is used. in this paper a new acceleration hardware based on fpga elements is proposed. this hardware can be adapted with the processing algorithm through fpga design ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Journal of Solid-State Circuits

سال: 2020

ISSN: 0018-9200,1558-173X

DOI: 10.1109/jssc.2019.2960480